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Abstract 

The relative entropy description of Holevo-Schumacher- Westmoreland (HSW) clas- 
sical channel capacities is applied to single qubit quantum channels. A simple formula 
for the relative entropy of qubit density matrices in the Bloch sphere representation is 
derived. The formula is combined with the King-Ruskai-Szarek- Werner qubit channel 
ellipsoid picture to analyze several unital and non-unital qubit channels in detail. An 
alternate proof is presented that the optimal HSW signalling states for single qubit 
unital channels are those states with minimal channel output entropy. The derivation 
is based on symmetries of the relative entropy formula, and the King-Ruskai-Szarek- 
Werner qubit channel ellipsoid picture. A proof is given that the average output density 
matrix of any set of optimal HSW signalling states for a { qubit or non-qubit ) quantum 
channel is unique. 
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1 Introduction 



In 1999, Benjamin Schumacher and Michael Westmoreland published a paper entitled 
Optimal Signal Ensembles [|| that elegantly described the classical (product state) channel 
capacity of quantum channels in terms of a function known as the relative entropy. Build- 
ing upon this view, we study single qubit channels, adding the following two items to the 
Schumacher- Westmoreland analysis . 

I) A detailed understanding of the convex hull shape of the set of quantum states output 
by a channel. (The fact the set was convex has been known for some time, but the detailed 
nature of the convex geometry was unknown until recently.) 

II) A useful mathematical representation (formula) for the relative entropy function, 
P( p II ), when both p and (j) are single qubit density matrices. 

For single qubit channels, the work of King, Ruskai, Szarek, and Werner has provided 
a concise description of the convex hull set||2|, ^. In this paper, we derive a useful formula 
for the relative entropy between qubit density matrices. Combining this formula with 
the KRSW convexity information, we present from a relative entropy perspective several 
results, some previously known, and others new, related to the (product state) classical 
channel capacity of quantum channels. These include : 

I) The average output density matrix for any optimal set of signalling states that achieves 
the maximum classical channel capacity for a quantum channel is unique. For single qubit 
unital channels, Donalds equality leads to a symmetry which tells us this average density 
matrix must be ^ I. This fact about the average density matrix allows us to conclude for 
unital qubit channels that the optimum signalling states are a subset of the states with 
minimum output von Neumann entropy, as previously shown in |2| . This symmetry also 
allows us to see why only two orthogonal signalling states are needed to achieve the optimum 
classical channel capacity for single qubit unital channels, and why the a priori probabilities 
for these two signalling states are ^. 

II) The single qubit relative entropy formula allows us to understand geometrically why 
the a priori probabilities for optimum signalling states for non-unital single qubit channels 
are not equal. 

III) Examples of channels which require non-orthogonal signalling states to achieve 
optimal classical channel capacity are given. Such channels have been found before. Here 
these channels are presented in a geometrical fashion based on the relative entropy formula 
derived in Appendix A. 

2 Background 
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2.1 Classical Communication over Classical and Quantum Channels 
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Figure 1: Transmission of classical information through a quantum channel. 



This paper discusses the transmission of classical information over quantum channels 
with no prior entanglement between the sender (Alice) and the recipient (Bob). In such a 
scenario, classical information is encoded into a set of quantum states tpi. These states are 
transmitted over a quantum channel. The perturbations encountered by the signals while 
transiting the channel are described using the Kraus representation formalism. A receiver 
at the channel output measures the perturbed quantum states using a POVM set. The 
resulting classical measurement outcomes represent the extraction of classical information 
from the channel output quantum states. 

There are two common criteria for measuring the quality of the transmission of classical 
information over a channel, regardless of whether the channel is classical or quantum. These 
criteria are the (Product State) Channel Capacity[Q, |5|, ^] and the Probability of Error 
(Pe).§ In this paper, we shall focus on the first criterion, the Classical Information Capacity 
of a Quantum Channel, C. 

In determining the classical channel capacity, we typically have an input signal constel- 
lation consisting of classical signals XiQ. The classical channel capacity C is defined as 

C = y^^'^{all possible Xi} H{X) - H{X\Y) 

Here H[X) is the Shannon entropy for the discrete random variable X. 
^ = { Pi = prob{xi) } , i = 1 , ■ ■ ■ , N . The Shannon entropy H{X) is defined as 
H{X) = — Y^f=i Pi \og{pi ). For conditional random variables, we denote the probability 
of the random variable X given Y as p{X\Y). The corresponding conditional Shannon 
entropy is defined as H{X\Y) = — Y^^=i Tl^Zi Pi^i Vj) ^oE.[p{xi\yj )]■ Our entropy 
calculations shall be in bits, so log2 is used. 

Suppose we have \X\ linearly independent and equiprobable input signals Xi, and possi- 
ble output signals yj, with \ Y\ > \X\. If there is no noise in the channel, then C = log(|X|). 
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Noise in the channel increases the uncertainty in X given the channel output Y, and thus 
noise increases H{ X \Y), thereby decreasing C for fixed H{X). Geometrically, the presence 
of random channel noise causes the channel mapping Xj yj to change from a noiseless 
one-to-one relationship, to a stochastic map. We say the possible channel mappings of Xi 
diffuse, occupying a region 6i instead of a single unique state yj. As long as the regions 
6i have disjoint support, the receiver can use Y to distinguish which X was sent. In this 
disjoint support case, H{X | y) ~ and C ~ H{X). This picture is frequently referred 
to as sphere packing, since we view the diffused output signals as roughly a sphere around 
the point in the output space where the signals would have been deposited had the channel 
introduced no perturbations. The greater the channel noise, the greater the radius of the 
spheres. If these spheres can be packed into a specified volume without significant overlap, 
then the decoder can distinguish the input state transmitted by determining which output 
sphere the decoded signal falls into. 

For sending classical information over a quantum channel, we adhere to the same picture. 
We seek to maximize H{X) and minimize H{X\Y in order to maximize the channel 
capacity C. We encode each classical input signal state { Xj } into a corresponding quantum 
state V'i- Sending tpi through the channel, the POVM decoder seeks to predict which Xi was 
originally sent. Similar to the classical picture, the quantum channel will diffuse or smear 
out the density matrix pi corresponding to the quantum state as the quantum state 
passes through the channel. The resulting channel output density matrix £{pi) will have 
support over a subspace (pi. As long as all the regions (pi have disjoint support, the POVM 
based decoder will be able to distinguish which quantum state pi entered the channel, and 
hewe H{X\Y) ^ 0, yielding C w H{X). 

For the classical capacity of quantum channels, we encode classical binary data into 
quantum states. The product state classical capacity for a quantum channel maximizes 
channel throughput by encoding a long block of m classical bits Xi into a long block con- 
sisting of a tensor product of n single qubit quantum states ipj in an optimal manner which 
maximizes (product state) classical channel capacity. 

{xi,X2, ••• Xm] Ipl 1p2 ■ ■ ■ ^ Ipn 

It has been widely conjectured, but not proven, that the product state classical channel 
capacity of a quantum channel is the classical capacity of a quantum channel. 

The Holevo-Schumacher- Westmoreland Theorem tells us that the classical product state 
channel capacity using the above encoding scheme is given by the Holevo quantity X of the 
output signal ensemble, maximized over a single copy of all possible input signal ensembles 



^1 ^^'^^{all possible Pi and Pi} 



output 



possible Pi and pi} 
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S{—) above is the von Neumann entropy. The symbol £{p) represents the output density 
matrix obtained by presenting the density matrix p at the channel input. Furthermore, the 
input signals pi can be chosen to be pure states without affecting the maximization |^]. Here- 
after we shall call Ci defined above the Holevo-Schumacher- Westmoreland (HSW) channel 
capacity. 



2.2 Relative Entropy and HSW Channel Capacity 

An alternate, but equivalent, description of HSW channel capacity can be made using 
relative entropy Q. The relative entropy V of two density matrices, g and (j), is defined as 

1^{q\\4>) = Tr[Qlog{g) - £»log((/>)] 

Here Tr[-] is the trace operator. Klein's inequality tells us that T) > 0, with P = iff 
g = </> Note that we shall usually take our logarithms to be base 2. 

To see how to represent x iii terms of P, consider the optimal signalling state ensemble 
{Pk , Qk = S{^k) }■ Define g as J2k Pk Qk- Consider the following sum : 

J2 Pk'^iQkWo) = iPk Tr[gk \og{ gk )] - Pk Tr[ gt \og{g) ]} 
k k 



^ {pk Tr [ gk log( gk)]} - Tr 



^ {pk gk log(e') } 



k Ik 

= {Pk Tr[ gk log( gk)]} - Tr[g log{g) ] = S{ g) - ^ Pk Qk) = X 
k k 

Thus, the HSW capacity Ci can be written as 

Ci = Ma.X[all possible {pk , ^i,}] J2 Pk'^ (^(^k)\\£iv)) 

k 

where the ipk are the quantum states input to the channel and (p = J2k Pk^k- We call an 
ensemble of channel output states {pk , gk = £{Pk) } an optimal ensemble if this ensemble 
achieves Ci. Schumacher and Westmoreland proved the following five properties related to 
optimal ensembles Q. 

I) T>{gk\\ g) = Ci Mgk in the optimal ensemble, and g = J2Pk Qk- 

II) V{^\\ g) < Ci where {pk, gk = £{pk) , Q = J2Pk Qk } an optimal ensemble, 
and ^ is any permissible channel output density matrix. 
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Ill) There exists at least one optimal ensemble {pk , Qk = £[^k) } that achieves Ci. 



IV) Let A be the set of possible channel output states for a channel £ corresponding to 
pure state inputs. Define B as the convex hull of the set of states A. Then for q ^ A and 
^ G S = the convex hull of we have Q : 



Ci = Min^ Max^ V{Q\\i) 



V) For every S, that satisfies the minimization in IV) above, there exists an optimum 
signalling ensemble {pk Pk} such that ^ = X^fc Pk Pk- 



2.3 The King - Ruskai - Szarek - Werner Qubit Channel Representation 

In this paper, we are primarily concerned with qubit channels, namely <?((/?) = Q, where 
and Q are qubit density matrices. Several authors ^ have developed a nice picture of 
single qubit maps. Recall that single qubit density matrices can be written in the Bloch 
sphere representation. Let the density matrices q and </? have the respective Bloch sphere 
representations : 

9? = ^ (2: + V\)<^ • a) and g = ^{I + W^,a) 

The symbol a means the vector of 2 x 2 Pauli matrices 



a 



ay 



where 



1 

1 



-i 

1 



The Bloch vectors W are real three dimensional vectors that have magnitude equal to 
one when representing a pure state density matrix, and magnitude less than one for a mixed 
(non-pure) density matrix. 

The King - Ruskai et al. qubit channel representation describes the channel as a mapping 
of input to output Bloch vectors. 



1 




-1 








- 




- 1 - 


Wx 
















Wx 


Wy 




ty 





\y 







Wy 














A. . 




-Wz- 



All qubit channels have such a representation. The representation is unique up to a 
unitary rotation, and hence requires a choice of basis. The tk and are real parameters 

^This result was originally derived in pOl. 
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which must satisfy certain constraints in order to ensure the matrix above represents a 
completely positive qubit map. ( Please see King - Ruskai for more detailsQ. ) 

Prom the King - Ruskai et al. qubit channel representation, we see that Wk = tk + 
\k Wfc or 



It has been shown that Ci can always be achieved using only pure input states [p. 
Therefore, all input signalling Bloch vectors obey || VV || = 1. Thus || VV |p = 1, and 
II >V ||2 = 1 = W2 + Wl + implies 

The set of possible channel output states we shall be interested in is the set of channel 
outputs corresponding to pure state channel inputs. This set of states was defined as A in 
section 2.2, and is the surface of the ellipsoid shown above. The convex hull of the set of 

states A is the solid ellipsoid defined as W such that 

3 Relative Entropy In The Bloch Sphere Representation 

The key formula we shall use extensively is the relative entropy in the Bloch sphere repre- 
sentation. Here p and (j) have the respective Bloch sphere representations : 

p = ^ (J + W . a) = i (J + V . a) 

We define cos{6) as : 



cos{6) = where r = \/yV»W and q = \/V»V 

r q 

In Appendix A, we prove the following formula for the relative entropy 2?( ^> || "0 ) of two 
single qubit density matrices g and ip with Bloch sphere representations given above. 



1 / \ r /l + rXl / \ VV • V 



1 + 



1 / 2\ ^1 fl + r\ 1 / 2\ ^ cos{9) /I + 

2 log2 (l - r') + 2 log2 [y^] - 2 l°g2 [I-q') ^ log^ 
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where 9 is the angle between W and V, and r and q are as defined above. 

When ^ in is the maximally mixed state = ^X, we have g = 0, and 

I'(/9 II ^) becomes the radially symmetric function 

P(p||<^) = v(^p\\\l^ = \ log2(l-r2) + \ log2(^) = 1 

It is shown in Appendix A that I? (^p|| ^ = 1 — where S{p) is the von 

Neumann entropy of p. In what follows, we shall often write I?( p || ^ ) as I?( VV || V ), where 
VV and V are the Bloch sphere vectors for p and (f) respectively. 

In what follows, we shall graphically determine the HSW channel capacity from the 
intersection of contours of constant relative entropy with the channel ellipsoid. To that 
end, and to help build intuition regarding channel parameter tradeoff's, it is advantageous 
to obtain a rough idea of how the contours of constant relative entropy P( p || (/> ) behave, for 
fixed ^, as p is varied. Furthermore, it will turn out that due to symmetries in the relative 
entropy, we frequently will only need to understand the relative entropy behavior in a plane 
of the Bloch sphere, which we choose to be the Bloch X-Y plane. In Figure 2, we plot a 
few contour lines for T>{p = ^ T) in the X-Y Bloch sphere plane. In the figures that 
follow, we shall mark the location of 4> with an asterisk. The contour values for P( p || ^ ) are 
shown in the plot title. The smallest value of T){p corresponds to the contour closest 
to the location of (j). The largest value of T){p || 0) corresponds to the outermost contour. 
For ^ = the location of (p is the Bloch sphere origin. 
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Figure 2: Contours of constant relative entropy D(/3||(/)) as a function of p in the Bloch 
sphere X-Y plane for the fixed density matrix cf) = \ T. 

As an example of how these contour lines change as (j) moves away from the maximally 
mixed state = ^ X, or equivalently as q becomes non-zero, we give contour plots below 

ro 

for g 7^ 0. We lei (f) = \ {T + q Gy } with corresponding Bloch vector V = q 

[o. 

The asterisk in these plots denotes the location of V. The dashed outer contour is a radius 
equal to one, indicating where the pure states lie. 
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q = 0.1, D(Rho||Phi) = 0.01, 0.05, 0.1, 0.15, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0 




Figure 3: Contours of constant relative entropy as a function of p in the Bloch 

sphere X-Y plane for the fixed density matrix cf) = \ {T + Q.lay}. 




q = 0.3, D(Rho||Phi) = 0.01, 0.05, 0.1, 0.15, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0 




Figure 5: Contours of constant relative entropy as a function of p in the Bloch 

sphere X-Y plane for the fixed density matrix cf) = i { X + 0.3 ctj^ }. 



q = 0.4, D(Rho||Phi) = 0.01 , 0.05, 0.1 , 0.1 5, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1 .0 




Figure 6: Contours of constant relative entropy T>{p\\(f)) as a function of p in the Bloch 
sphere X-Y plane for the fixed density matrix ^ = \ {T + QAay}. 
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q = 0.6, D(Rho||Phi) = 0.01 , 0.05, 0.1 , 0.1 5, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1 .0 




Figure 8: Contours of constant relative entropy T>{p\\(f)) as a function of p in the Bloch 
sphere X-Y plane for the fixed density matrix ^ = \ {T + Q.Qay}. 
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q = 0.7, D(Rho||Phi) = 0.01, 0.05, 0.1, 0.15, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0 




Figure 9: Contours of constant relative entropy as a function of p in the Bloch 

sphere X-Y plane for the fixed density matrix cf) = i { X + Q.l Gy }. 



q = 0.8, D(Rho||Phi) = 0.01 , 0.05, 0.1 , 0.1 5, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1 .0 




Figure 10: Contours of constant relative entropy as a function of p in the Bloch 

sphere X-Y plane for the fixed density matrix ^ = \ {T + Q.Say}. 
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q = 0.9, D(Rho||Phi) = 0.01, 0.05, 0.1, 0.15, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0 




-i^ -0.5 ^ ^05 




Figure 11: Contours of constant relative entropy as a function of p in the Bloch 

sphere X-Y plane for the fixed density matrix cp = ^ {I + 0.9 }. 

The two dimensional plots oiV{p\\4>) shown above tell us about the three dimensional 
nature of P( p || </> ). To sec why, first note that we can always rotate the Bloch sphere X-Y-Z 
axes to arrange for = V ^ g to lie on the Y axis, as the density matrices (f) are shown 
in Figures 2 through 11 above. 

Second, recall that our description of is a function of the three variables 

{ r , q, 9} only, which were defined above as the length of the Bloch vectors corresponding 
to the density matrices p and cf) respectively, and the angle between these Bloch vectors. 

npU) = fir,q,e) 

This means the two dimensional curves of constant P(p || 0) can be rotated about the Y 
axis as surfaces of revolution, to yield three dimensional surfaces of constant T>{p\\(j)). (In 
these two and three dimensional plots, the first argument of P( p || ), p, is being varied, 
while the second argument, (f), is being held fixed at a point on the Y axis.) 

Our two dimensional plots above give us a good idea of the three dimensional behavior 
of the surfaces of constant relative entropy about the density matrix (p occupying the second 
slot in P( • • • II • • • ). A picture emerges of slightly warped "eggshells" nested like Russian 
dolls inside each other, roughly centered on cp. A mental picture of the behavior of !?( p || ) 
is useful because in what follows we shall superimpose the KRSW channel ellipsoid (s) onto 
Figures 2 through 11 above. By moving V (the asterisk) around in these pictures, we 
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shall adjust the contours of constant /O || (p), and thereby graphically determine the HSW 
channel capacity, optimum (output) signalling states, and corresponding a priori signalling 
probabilities. The resulting intuition we gain from these pictures will help us understand 
channel parameter tradeoffs. 

4 Linear Channels 

Recall the KRSW specification of a qubit channel in terms of the six real parameters 
{ tx , ty , tz , , , } as defined on page |^ of this paper. A linear channel is one 
where = Xy = 0, but ^ 0. The shift quantities can be any real number, up 
to the limits imposed by the requirement that the map be completely positive. For more 
details on the complete positivity requirements of qubit maps, please see |^. 

A linear channel is a simple system that illustrates the basic ideas behind our graph- 
ical approach to determining the HSW channel capacity Ci. Recall the relative entropy 
formulation for Ci. 

Cl = MaX[a;;po^3jfc,e{p^ _^^|] ^ Pk V { £ { Ifk) \\ £ { If) ) 

k 

where the ipk are the quantum states input to the channel and 99 = ^jr. pk (fk- We call an 
ensemble of states {pk , Qk = ^iv^k) } an optimal ensemble if this ensemble achieves Ci. 

As discussed on page ^, Schumacher and Westmoreland showed the above maximization 
to determine Ci is equivalent to the following min-max criterion : 

Cl = Min^ Max^, ViokW^) 

where is a density matrix on the surface of the channel ellipsoid, and is a density 
matrix in the convex hull of the channel ellipsoid. For the linear channel, the channel 
ellipsoid is a line segment of length 2 centered on { tx , ty , tz }■ Thus, both and (j) 
must lie somewhere along this line segment. Furthermore, Schumacher and Westmoreland 
tell us that (p must be expressible as a convex combination of the g^ which satisfy the above 
min-max Q. 

To graphically implement the min-max criterion, we overlay the channel ellipsoid on the 
contour plots of relative entropy previously found. We wish to determine the location of 
the optimum (p and the optimum relative entropy contour that achieves the min-max. The 
generic overlap scenarios are shown below, labeled Cases 1-5. From our plots of relative 
entropy, we know that contours of relative entropy are roughly circular about (p. We denote 
the location of (p below by an asterisk ( * ). The permissible g^ are those density matrices 
at the intersection of the relative entropy contour and the channel ellipsoid, here a line 
segment. 

Let us examine the five cases shown below, seeking the optimum (p and the relative 
entropy contour corresponding to Ci (the circles below), by eliminating those cases which 
do not make sense in light of the minimization-maximization above. 
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Case 1 



Case 2 



Case 3 



Case 4 Case 5 



Figure 12: Scenarios for the intersection of the optimum relative 
entropy contour with a linear channel ellipsoid. 



Case 1 is not an acceptable configuration because (p does not lie inside the channel 
ellipsoid, meaning for the linear channel, (j) does not lie on the line segment. Case 2 is not 
acceptable because there are no permissible g^, since the relative entropy contour does not 
intersect the channel ellipsoid line segment anywhere. Case 3 is not acceptable because 
Schumacher and Westmoreland tell us that (p must be expressible as a convex combination 
of the Qk density matrices which satisfy the above min-max requirement. There is only 
one permissible density matrix in Case 3, and since, as seen in the diagram for Case 3, 
(j) 7^ we do not have an acceptable configuration. Case 4 at first appears acceptable. 
However, here we do not achieve the maximization in the min-max relation, since we can do 
better by using a relative entropy contour with a larger radii. Case 5 is the ideal situation. 
The relative entropy contour intersects both of the line segment endpoints. Taking a larger 
radius relative entropy contour does not give us permissible gk, since we would obtain Case 
2 with a larger radii. For Case 5, if we moved (p as we increased the relative entropy contour, 
we would obtain Case 3, again an unacceptable configuration. In Case 5, using the two gk 
that lie at the intersection of the relative entropy contour and the channel ellipsoid line 
segment, we can form a convex combination of these gk that equals (f). Case 5 is the best we 
can do, meaning Case 5 yields the largest radius relative entropy contour which satisfies the 
Schumacher- Westmoreland requirements. The value of this largest radii relative entropy 
contour is the HSW channel capacity we seek, Ci. 

We now restate Case 5 in Bloch vector notation. We shall associate the Bloch vector 
V with (j), and the Bloch vectors Wfc with the gk density matrices. For the linear channel, 
from our analysis above which resulted in Case 5, we know that V must lie on the line 
segment between the two endpoint vectors VV+ and VV_ . (Note that from here on, we shall 
drop the tilde ~ we were previously using to denote channel output Bloch vectors, as almost 
all the Bloch vectors we shall talk about below are channel output Bloch vectors. The few 
instances when this is not the case shall be obvious.) 

For a general linear channel, the KRSW ellipsoid channel parameters satisfy 



{t^ ^ , ty ^ , t, , = , Xy = , X, ^ 0}. 

Thus, we can explicitly determine the Bloch vectors VV+ and VV_, which we write below. 



t 



X 



t 



X 



and p- — W_ = 




_tz + Xz_ 
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Note that the tk and Xz are real numbers, and any of them may be negative. 

The Bloch vector V however requires more work. We parameterize the Bloch sphere 
vector V corresponding to (f) by the real number a, specifying a position for V along the line 
segment between yV+ and W-. 



V 



tx 

ty 

tz + aX, 



Here a £ [—1,1]. Now recall that the Schumacher- Westmoreland maximal distance 
property ( see property # I in Section |2]^ ) tells us that D{p^\\(f)) = D{p^\\(l)). To find V, 
we shall apply the formula we have derived for relative entropy in the Bloch representation 
to -D(/9+||0) = D{p-\\(p), and solve for a. The details are in Appendix B. 

4.1 A Simple Linear Channel Example 

To illustrate the ideas presented above, we take as a simple example the linear channel with 
channel parameters : { tx = , ty = , = 0.2 , A^,. = , Aj^ = , = 0.4 }. 
Because the channel is linear with tx = ty = 0, we shall be able to easily solve for V and 
Ci. 

We define the real numbers r_|_ and r_ as the Euclidean distance in the Bloch sphere 
from the Bloch sphere origin to the Bloch vectors W+ and W_. That is, r_|_ and r_ are the 
magnitudes of the Bloch vectors yV+ and W- defined above. For the channel parameter 
numbers given, we find r_|_ = || 0.4 + 0.2 || = 0.6 and r_ = || 0.2 — 0.4 || = 0.2. We 
similarly define q to be the magnitude of the Bloch vector V. 

To find V, we shall apply the formula we have derived for relative entropy in the Bloch 
representation to Z)(p-|-| |i?!>) = D(p^\\(j)), or in Bloch sphere notation, Z)(yV+ ||V) = 
D(yV- II V). The formula for relative entropy derived in Appendix A is : 

-n^ II ^^ ^1 ^^ A ^ ''k . f l + rk \ 1, 2\ • V , fl + 

T^iQkH) = 2 log2 y- - ^k) +^ log2 lyY~^) ~ 2 - 1 ) ^ ^og2 

where 9^ is the angle between Wk and V. Intuitively, one notes that the nearly circular 
relative entropy contours about (p = V tells us that V ~ w+jhW^^ Given the channel 
parameter numbers, this fact about V, together with the linear nature of the channel ellip- 
soid, tell us that 0+ = and 9- = n, so that cos(6'+ ) = 1 and cos(0_ ) = —1. Using 
this information about the 9k, and the identity 

tanh^~"^^[ x] = — log ^ ~^ 
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the relative entropy equality relation between the two endpoints of the linear channel 
can be solved for q. 



Qoptimum — tanh 



i In 



l-rl 



+ r+ tanh*- ^•'[r+] — r_ tanh^ ^^[r 



0.2125. 



Thus, 



W+ = 






0.6 






-0.2 



and V = 



" 


0.2125 



The corresponding density matrices are : 



p+ = i(T + W+.a), p_ = ^(X + >V_.a), 



= -{I + V.a) 



This yields V{p+\\(j)) = D(p_ || ^) = 0.1246. Thus, the HSW channel capacity Ci is 
0.1246. The location of the two density matrices p+ and /9_ are shown in Figure 13 below 
as O. 

Furthermore, the Schumacher- Westmoreland analysis tells us that the two states p+ and 
p- must average to 0, in the sense that if p+ and p- are the a priori probabilities of the 

two output signal states, then + p^ = (f>. In our Bloch sphere notation, this 

relationship becomes p+ VV+ + p- VV_ = V. The asterisk (* ) in Figure 13 below shows 
the position of V. 

Another relation relating the a priori probabilities p+ and p_ is p+ + P- = 1- Using 
these two equations, we can solve for the a priori probabilities p+ and p_. For our example. 



+ p-W- = p+ 


' " 




' " 


= V = 


" " 





+ p- 










.0.6. 




.-0.2. 




.0.2125. 



Solving for p+ and p_ yields p+ = 0.5156 and p_ = 0.4844. 

Note that here we have found the optimum output signal states p-|_ and p_. From these 
one can find the optimum input signal states by finding the states and which map 
to the respective optimum output states and p„. In our example above, these are 



Input 



"0" 


1 



and^'- ^ >V_ 



Input 
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tx = 0.0, ty = 0.0, tz = 0.2, LambdaX = 0.0, LambdaY = 0.0, LambdaZ = 0.4 




Figure 13: The intersection in the Bloch sphere X-Z plane of a hnear channel ellipsoid and 
the optimum relative entropy contour. The optimum output signal states are shown as O. 

For the general linear channel, where any or all of the can be non-zero, we can reduce 
the capacity calculation to the solution of a single, one dimensional transcendental equation. 
( Please see Appendix B for the full derivation. ) 

Define 

rl = tl + tl + {t, + X, )2 
q'^ = tl + tl + {t, + f3\,f 

= tl + tl + {U - K f 

The two quantities r+ and r_ are the Euclidean distances from the Bloch sphere origin 
to the signalling states = yV+ and p_ = VV_ respectively. The quantity q is the 
Euclidean distance from the Bloch sphere origin to the density matrix (j) = V. We define 
the three Bloch vectors r+, q and r_ in Figure 14 below, and refer to their respective 
magnitudes as r+, q, and r_. 
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q 



Bloch 
Sphere 
Origin 



Figure 14: Definition of the Bloch vectors r+ , q, and f_ used in the derivation below. 
We solve the transcendental equation below for j3. 



4 A, {t, + PX, ) tanh(-i)(g) 



= 2r+ tanh(-^)(r+) - 2r_ tanh(-^)(r_) + ln( 1 - rl ) - ln( 1 - ) 



Note that g is a function of /3, while r_|_ and r_ are not. Thus, the right hand side 
remains constant while /? is varied. The smooth nature of the functions of /3 on the left 
hand side allow a solution for (3 to be found fairly easily. 

As in our simpler linear channel example above, we have 











ty 


, W- = 


ty 


, and 


_tz + Xz. 




-tz ~ Xz. 





where /? G ( — 1, 1 ). The corresponding density matrices are : 

p+ = \{T + W+.a), p_ = ^(J + >V_.a), = 
The channel capacity Ci is found from the relations 



tx 

ty 

tz+PXz\ 



-{1+ V.a 



D{p+\\<t>) = D{p_\\<t>) = Xoptvr, 



= Ci . 



The a priori signaling probabilities are found by solving the simultaneous probability 
equations p-\. + p- = 1, and 





tx 




tx 




tx 


p+ 


ty 


+ P- 


ty 


= V = 


ty 




-tz + Xz. 




-tz ~ Xz. 




.tz + PXz. 
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This leads to a second probability equation of pj^ — p- = 



P, yielding: 



1 + (3 



and 



P- 



f3 



4.2 A More General Linear Channel Example 

In the simple linear channel example above, we used { tx = ty = , Xx = Xy = }. 
This choice yielded a rotational symmetry about the Z - axis which assured us the location 
of the optimum average output density matrix p = p^ -\- p- p- was on the Z - axis. 
We used this fact to advantage in predicting the angles 9^^ ^ where 9^_^_ _ _} was the angle 
between W{+^_} and V. Since we knew yV{_|-^_} lay on the Z - axis, we found ^+ = 



and 6^ 



TT, simplifying the cos ( 6 



terms in the relative entropy expressions for 



and In general, we do not have values of it 1 for cos ^^{_|_ _} 

this complicates finding a solution for the linear channel relation D{p^\\(j)) = D{p- 



and 



A more general linear channel example is one where the parameters { tx , ty , tz } 
are all non-zero. Consider the parameter set { tj, = 0.1, ty = 0.2, tz = 0.3, Xx = 
0, Xy = 0, Xz = 0.4 }. Solving the transcendental equation derived in Appendix B yields 

r 0.1 1 

P = 0.0534 and V = 0.2 . Using the density matrix calculated from the Bloch 

L 0.3214 J 

vector V gives us a HSW channel capacity Ci of D{p. 

As discussed above, p+ + p_ = 1, and p+ — p 
p+ = 0.5267 and p_ = 0.4733. 

The optimum input Bloch vectors are : 



= D{p^\\cf>) = 0.1365. 

p. Solving for p^ and p_ yields 



Input 



and if- 



Input 



The optimum output Bloch vectors are : 



Output 



"0.1" 
0.2 
0.7 



and 



P- = 



TjOutput 



■ 0.1 " 
0.2 
-0.1 



Below we show in Figure 15 and Figure 16 the {x, z} and {y, z] slices of the linear 
channel ellipsoid. One can see that the relative entropy curve D{p\\(j)) = Ci = 0.1365 
touches the ellipsoid at two locations in both cross sections. (The {x,y} cross section is 
trivial.) 
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Figure 15: The intersection in the Bloch sphere X-Z plane of a hnear channel ellipsoid and 
the optimum relative entropy contour. The optimum output signal states are shown as O. 



q = 0.39150, D(Rho||Phi) = 0.1365167 




Figure 16: The intersection in the Bloch sphere Y-Z plane of a linear channel ellipsoid and 
the optimum relative entropy contour. The optimum output signal states are shown as O. 



23 



5 Planar Channels 



A planar channel is a quantum channel where two are non-zero, and one is zero. For 
a planar channel, the { tfc } can have any values allowed by complete positivity. A planar 
channel restricts the possible output density matrices to lie in the plane in the Bloch sphere 
which is specified by the non-zero Afc. In comparison to the linear channels discussed above, 
the planar channels additional output degree of freedom (planar has two non-zero A^ versus 
a single linear non-zero A^) means a slightly diff'erent approach to determining Ci than that 
discussed for linear channels must be developed. As for linear channels, we seek to find the 
optimum density matrix = V interior to the ellipsoid which minimizes the distance to 
the most "distant", in a relative entropy sense, point(s) on the ellipsoid surface. We shall 
find the optimum V in two ways : graphically and iteratively. Both approaches utilize the 
following theorem from Schumacher and Westmoreland 

Theorem : 

Ci = Min^ Maxp D{p\\(f>) 

The maximum is taken over the surface of the ellipsoid, and the minimum is taken over 
the interior of the ellipsoid. In order to apply the min max formula above for Ci for planar 
channels, we need a result about the uniqueness of the average output ensemble density 
matrix p = J2k Pk Pk for different optimal ensembles {pk , Pk}- 

5.1 Uniqueness Of The Average Output Ensemble Density Matrix 

The question we address is if there exists two optimum signalling ensembles, { Pk , Pk} and 
{ P'k ^ } of channel output states, whether the two resulting average density matrices, 
P = T,k Pk Pk and p' = 'Zk P'k P'k are equal. 

Theorem : The density matrix (p which achieves the minimum in the min-max formula 
above for Ci is unique. 

Proof : 

From property V in Section 2.2, we know the optimum (j) which attains the minimum 
above must correspond to the average of a set of signal states of an optimum signalling 
ensemble. We shall prove the uniqueness of (j) by postulating there are two optimum signal 
ensembles, with possibly different average density matrices, a and ^. We will then prove 
that a must equal ^, thereby implying (j) is unique. 

Let { Oj , Pi } be an optimum signal ensemble, with probabilities Oj and density matrices 
Pi, where > and = 1- Define a = Q^i Pi- By property I in Section 2.2, we 

know that II 0-) = Xoptimum. = Ci Vi. 

Now consider a second, optimum signal ensemble { [3j , (pj } diff'ering in at least one 
density matrix pi and/or one probability from the optimum ensemble {a^ , Pi}. Define 
C = Z]j f^j 4'j- Consider the quantity J2i on T^i Pi\\ O- Let us apply Donald's equality, 
which is discussed in Appendix C. 
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i i 

Since T>{pi\\a) = Xoptimum Vz, and J2i = 1) we obtain : 

Xoptimum 

i 

Prom property II in Section 2.2, since ^ is the average of a set of optimal signal 
states {/3j, (t>j}, we know that T>{pi\\^) < Xoptimum Vi. Thus ai V{pi\\^) < 
Xoptimum- Combining this inequality constraint on ai T>{pi\\£^ ) with what we know 
about Ylii Oil T^i^pi II 4 ) from Donald's equality, we obtain the two relations : 

E = + Xoptvmum and E < Xoptimum 

i i 

From Klein's inequality, we know that P(a" || ^) > 0, with equality iff a = ^. Thus, 
the only way the equation 

E ai^(/3i|IO = + Xoptimum 

i 

can be satisfied is if we have a = ^, for then P( a || ^ ) = and we have 

^ ^ Q^i^(pi||0 ~ '^(''^110 ~l~ Xoptimum ~ Xoptimum 
i 

and 

^ ] Q; j 2? ( Pi 1 1 ^ ) = ^ ] C^i ( Pi II ''^ ) ^ ] t^i Xoptimum ~ Xoptimum 

i i i 

Therefore, only in the case where cr = ^ is Donald's equality satisfied. Since a and ^ 
were the average output density matrices for two different, but arbitrary optimum signalling 
ensembles, we conclude the average density matrices of all optimum signalling ensembles 
must be equal, thereby implying (j) is unique. 

A - End of Proof. 

Note that although we are primarily concerned with qubit channels in this paper, only 
generic properties of the relative entropy were used in the above proof of uniqueness, and 
therefore the result holds for all channels. 

5.2 Graphical Channel Optimization Procedure 

We shall now describe a graphical technique for finding 4>optimum — '^optimum- 

Recall the 

contour surfaces of constant relative entropy for various values of V shown previously. We 
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seek to adjust the location of V inside the channel ellipsoid such that the largest possible 
contour value Vmax = V^l^ \\V) touches the ellipsoid surface, and the remainder of the 
T^max contour surface lies entirely outside the channel ellipsoid. Our linear channel example 
illustrated this idea. In that example, the T>max contour intersects the "ellipsoid" at r_|_ 
and r_, and otherwise lies outside the line segment between r+ and r_ representing the 
convex hull of A. (Recall from the discussion of the Schumacher and Westmoreland paper 
in Section 2.2 that the points on the ellipsoid surface were defined as the set A, and the 
interior of the ellipsoid, where V lives, is the convex hull of A.) 



A good place to start is with ViniUai 



ty 

tz 



We then "tweak" V as described above 



to find Voptimum- Note that VopUmum should be near ViniUai because of the almost radial 
symmetry of D about V as seen in Figures 2 through 11. 

This technique is graphically implementing property IV in Section 2.2. In Bloch sphere 
notation, we have : 

Ci = Min^ Max^ P ( w || V 

where W is on the channel ellipsoid surface and V is in the interior of the ellipsoid. Moving 
V from the optimum position described above will increase MaXy^ P ^ VV || V ^, since a 
larger contour value of 2? would then intersect the channel ellipsoid surface, thereby increas- 
ing MaXy^ P 1^ VV II V ^ . Yet V should be adjusted to minimize Maxy^ V (^\V \\ V ^ . 

5.3 Iterative Channel Optimization Procedure 

For the iterative treatment, we outline an algorithm which converges to '^optimum- First, we 
need a lemma. 

Lemma : Let V and W be any two Bloch sphere vectors. Define a third Bloch sphere 
vector U as : 

U = {I -a)W + aV 

where a G (0,1). Then 

V{W\\U) < P(W||V) 
Proof : By the joint convexity property of the relative entropy |^ : 

V{{api+ {1 - a)p2}\\{a^i+ {I - a)^2}) < aP(/9i||0i) + (1 - a)P(p2||02) 

where a G (0, 1). Let pi = P2 = ^V, (pi = V and (/)2 = W with U = {1 -a) W + aV. 
We obtain : 

V{W\\li) = V{W\\aV + {1 - a)W) < aV{W\\V) + {1 - a)V{W\\W) 
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But V{W\\W) = 0, by Klein's inequalityg. Thus, 

V{W\\U) < aV{W\\V) < V{W\\V) 

since a £ (0, 1). 
A - End of Proof. 

We use the lemma above to guide us in iteratively adjusting V to converge towards 
Voptimai- Consider P(y\) || V), where VV £ A and V G ;B = the convex huh of A. We 
seek to find Ci in an iterative fashion. We do this by holding V fixed, and finding one 
of the W" G A which maximizes I?(yV|| V). From our lemma above, if we now move 
V towards W, we shall cause Vmax{V) = MaXy^ P(W||V) to decrease. We steadily 
decrease I'max(V) in this manner until we reach a point where any movement of V will 
increase Vmax ( V ) . Our uniqueness theorem above tells us there is only one V optimal ■ Our 
lemma above tells us we cannot become stuck in a local minima in moving towards VopUmai ■ 
Thus, when we reach the point where any movement of V will increase T^max ( ^ ) ? SilQ 
done and have found V final = V optimum- 

To summarize, we find the optimum V using the following algorithm. 

1) Generate a random starting point Vmitiai in the interior of the ellipsoid ( G B ). (In 
actuality, since the contour surfaces of constant relative entropy are roughly spherical about 

tx 

' •) 



V, a good place to start is V,, 



initial 



ty 

t. 



2) Determine the set of points { W } on the ellipsoid surface most distant, in a relative 
entropy sense, from our V. This maximal distance is Vmax ( V ) defined above as Vmax ( V ) = 
Max^, V{W II V). 

3) Choose at random one Bloch sphere vector from our maximal set of points { W }. 
Call this selected point W. In the 3 real dimensional Bloch sphere space, make a small 
step from V towards the surface point vector, W. That is, update V as follows : 

Vneu, = (1 - e)Void + eW 

4) Loop by going back to step 2) above, using our new, updated Vnew, and continue to 
loop until Vmax IS no longer changing. 

This algorithm converges to (poptimum = ^optimum, because we steadily proceed downhill 
minimizing MaXy^ V (^W \\ V j , and our lemma above tells us we can never get stuck in 
a local minima. 

5.4 Planar Channel Example 

We demonstrate the iterative algorithm above with a planar channel example. Let 
{t^ = 0.3 , ty = 0.1 , = Q , Xx = 0.4 , \y = 0.5 , = }. The 
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0.3209- 
0.1112 



0.1994. Shown below in Figure 17 is a plot of the planar channel el- 



iterative algorithm outlined above yields V = 

= 1^ optimum 



and a HSW channel capacity 



lipsoid (the inner curve), and the curve of constant relative entropy T>{p\\(l)) = T> optimum 
centered at V, which is marked with an asterisk *. One can see that the X'mao; curve in- 
tersects the ellipsoid curve at two points, marked with O, and these two points are the 
optimum channel output signals pi. 

The optimum input and output signalling states for this channel were determined as 
described in Appendix E and are : 



Pi = 0.4869, 



Input 



-0.0207 
-0.9998 




-^Output 



0.2917 
-0.3999 




y^^nput ^ 



0.1215 

0.9926 




-^Output 



P2 = 0.5131, 

These signal states yield an average channel output Bloch vector V of 



0.3486 
0.5963 




V = Pi -W, 



l^rOutpUt 



Output 



0.3209 
0.1113 




Figure 17 below shows the location of the channel ellipsoid ( the inner dashed curve ), 
the contour of constant relative entropy ( the solid curve ) for V = 0.1994, the location of 
the two optimum input pure states pj"'^^'', (the two O states on the circle of radius one), and 
the two optimum output signal states p^"*^"*, also denoted by O, on the channel ellipsoid 
curve. Note that the optimum input signalling states are non-orthogonal. 
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tx = 0.3, ty = 0.1 , tz = 0, LambdaX = 0.4, LambdaY = 0.5, LambdaZ = 
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Figure 17: The intersection in the Bloch sphere X-Y plane of a planar channel ellipsoid 
(the inner dashed curve) and the optimum relative entropy contour (the solid curve). The 

two optimum input signal states (on the outer bold dashed Bloch sphere boundary curve) 
and the two optimum output signal states (on the channel ellipsoid and the optimum 
relative entropy contour curve) are shown as O. 

Another useful picture is how the relative entropy changes as we make our way around 
the channel ellipsoid. Wc consider the Bloch X-Y plane in polar coordinates { r , 9}, where 
we measure the angle 9 with respect to the origin of the Bloch X-Y plane axes. ( Note that 
9 only fully ranges over [0, 27r] when the origin of the Bloch sphere lies inside the channel 
ellipsoid. ) The horizontal line at the top of the plot is the channel capacity Ci = 0.1994. 
Note that the two relative entropy peaks correspond to the locations of the two output 
optimum signalling states. 
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Figure 18: The change in P( p\\(f)= * ) as we move p around the channel elhpsoid. The 
angle theta is with respect to the Bloch sphere origin. 



For this channel, the optimum channel capacity is achieved using an ensemble consisting 
of only two signalling states. Davies theorem tells us that for single qubit channels, an 
optimum ensemble need contain at most four signalling states. Using the notation of |14], 
we call C2 the optimum output Ci HSW channel capacity attainable using only two input 
signalling states, C3 is the optimum output Ci HSW channel capacity attainable using 
only three input signalling states, and C4 is the optimum output Ci HSW channel capacity 
attainable using only four input signalling states. Thus, for this channel, we see that 
C2 = C3 = Ci. That is, for this channel, allowing more than two signalling states in 
your optimal ensemble does not yield additional channel capacity over an optimal ensemble 
with just two signalling states. 



6 Unital Channels 

Unital channels are quantum channels that map the identity to the identity : £{T) = T. 
Due to this behavior, unital channels possess certain symmetries. In the ellipsoid picture, 
King and Ruskai Q have shown that for unital channels, the { tfc } are zero. This yields 
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an ellipsoid centered at the origin of the Bloch sphere. The resulting symmetry of such an 
ellipsoid will allow us to draw powerful conclusions. 

First, recall that we know there exists at least one optimal signal ensemble, {pi , Pi}, 
which attains the HSW channel capacity Ci. ( See property III in Section |2.2| . ) Now 
consider the symmetry evident in the formula we have derived for the relative entropy for 
two single qubit density operators. We have : 

V{pU) = V{W\\V) = f{r,q,9) 

where r = \\W \\, q = || V ||, and 9 is the angle between W and V. Thus, if pi £ A 
and (j) e B, with VipiWa) = ^(WjllV) = XopUmum = Ci, then acting in TZ^, 
reflecting pi = Wj and a = V through the Bloch sphere origin to obtain p'^ = W[ and 
a' = V', yields elements of A and B respectively. Furthermore, these transformed density 
matrices will also satisfy V{ p[ = V{ W[ \\V' ) = XopUmum = Ci, because r, q, and 9 

remain the same when we reflect through the Bloch sphere origin. That is, the symmetry 
of the unital channel ellipsoid about the Bloch sphere origin, corresponding to the density 
matrix ^ T, together with the symmetry present in the qubit relative entropy formula yields 
a symmetry for the optimal signal ensemble {pi ■, Pi}, where a = J2i Pi Pi^ or equivalently 
V = J2i Pi This symmetry indicates that for every optimal signal ensemble {pi , Pi}, 
there exists another ensemble, {p'i, p'^}, obtained by reflection through the Bloch sphere 
origin. Since we know there exists at least one optimal signal ensemble, we must conclude 
that if 0" = J2i Pi Pi 7^ then two optimal ensembles exist with a a'. However, 
by our uniqueness proof above, we are assured that a = J2i Pi Pi is a unique density 
matrix, regardless of the states {pi , Pi} used, as long as the states {pi , Pi} are an optimal 
ensemble. Thus we must conclude that a = J2i Pi Pi = ^ ^i since only the density matrix 

maps into itself upon reflection through the Bloch sphere origin. Summarizing these 
observations, we can state the following. 

Theorem : 

For all unital qubit channels, and all optimal signal ensembles {pi, /Oj}, the average 
density matrix a = Pi Pi = \ ^■ 

In Appendix A, it is shown that 

V<^p\\\l'^ = 1 - S{p) 

where S{p ) is the von Neumann entropy of the density matrix p. Thus, our relation for 
the HSW channel capacity Ci becomes : 

Ci = XI Pi'^i^Pi II = 1 ~ X PiSiPi) 

To maximize Ci, we seek to minimize the J2i 'SiPi)^ subject to the constraint that 
the Pi satisfy J2i Pi Pi = ^^i some set of a priori probabilities {pi}- Recall that 
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S{p) = 5(r)isa strictly decreasing function of r, where r is the magnitude of the Bloch 
vector corresponding to p. (Please see the plot below of <S(p) = S{r).) 
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von Neuman entropy 
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Figure 19: The Von Neumann entropy S{p) for a single qubit p 
as a function of the Bloch sphere radius r G [0, 1]. 

Thus we seek to find a set of pi which lie most distant, in terms of Euclidean distance 
in TZ^, from the ellipsoid origin, and for which a convex combination of these states equals 
the Bloch sphere origin. 

Let us examine a few special cases. For the unital channel ellipsoid, consider the case 
where the major axis is unique in length, and has total length 2 A™"'''"'"^**. Let p+ and p_ 
be the states lying at the end of the major axis. By the symmetry of the ellipsoid, we have 



1 1 

2^+ + 2^- 



2 



+ 



Furthermore, the magnitude of the corresponding Bloch sphere vectors r-^- - 
and r_ = II VV_ || are equal, r+ = r_ = 1 — | \^"-3<>raxis y 

Above, we use | • • • | around yriO'Ooraxis because A"*"-'"''"^** can be a negative quantity in 
the King - Ruskai et al. formalism. Using this value of r = r+ = r_ yields for Ci : 



Ci = 1 - 2 



1 



yniajc 
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If the major axis is not the unique axis of maximal length, then any set of convex proba- 
bilities and states {pi , Pi} such that the states lie on the major surface and J2i Pi Pi = 
will suffice. 

Thus we reach the same conclusion obtained by King and Ruskai in an earlier paper j^. 
Summarizing, we can state the following. 

Theorem : 

The optimum output signalling states for unital qubit channels correspond to the min- 
imum output von Neumann entropy states. 

Furthermore, we can also conclude : 
Theorem : 

For unital qubit channels, the channel capacities consisting of signal state ensembles with 
two, three and four signalling states are equal. Furthermore, the optimum HSW channel 
capacity can be attained with a, possibly non-unique, pair of equiprobable {pi = P2 = ^) 
signalling states arranged opposite one another with respect to the Bloch sphere origin. 

Proof : 

Using the notation above, C2 = C3 = C4. From the geometry of the centered 
channel ellipsoid, we can always use just two signalling states with the minimum output 
entropy to convexly reach ^ X. Thus, utilizing more than two signaling states will not yield 
any channel capacity improvement beyond using two signalling states. The equiprobable 
nature of the two signalling states derives from the symmetry of the signalling states on 
the channel ellipsoid, in that one signalling state being the reflection of the other signalling 
state through the Bloch sphere origin means the states may be symmetrically added to yield 
an average state corresponding to the Bloch sphere origin. It is this reflection symmetry 
which makes the two signalling states equiprobable. 

A - End of Proof. 

The last three theorems were previously proven by King and Ruskai in section 2.3 of 
S. Here we have merely shown their results in the relative entropy picture. 



6.1 The Depolarizing Channel 

The depolarizing channel is a unital channel with {t^ = 
cussed in more detail in Appendix D. The parameter x G 
we can conclude that : 



4.x- 



} and { Afc 
[ , 1 ]. Using the analysis above, 
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We plot Ci below. 



Depolarizing HSW Ciiannel Capacity 




0.2 0.4 0.6 0.8 1 

Depoiarizing Channei Parameter x 



Figure 20: The Holevo-Schumacher- Westmoreland classical channel capacity for the 
depolarizing channel as a function of the depolarizing channel parameter x. 

6.2 The Two Pauli Channel 

The Two Pauli channel is a unital channel with { = 0} and {A^- = Xy = x}, and 
{Xz = 2x — 1}, as discussed in more detail in Appendix D. The parameter x G [0,1]. 
The determination of the major axis/surface is tricky due to the need to take into account 
the absolute value of the A^. We plot below the absolute value of the Afc. The dotted curve 
below corresponds to the absolute value of A^: and Ay. The V-shaped solid curve corresponds 
to the absolute value of Xz- 
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0.4 0.6 
Channel Parameter x 



Figure 21: Calculating the length of the major axis of the channel ellipsoid for the two 
pauli channel as a function of the two pauli channel parameter x. 

The intersection point occurs at x = |. Thus Xz is the major axis for x < ^ and the 
{ Aa; , Aj; } surface is the major axis surface for a; > ^. The Bloch sphere radius correspond- 
ing to the minimum entropy states is 1 — 2 a; for x < | and a: for x > | . 

Using our analysis above, we can conclude that for x < |, we have : 



Ci = 1 - 2 



S{r) 



1 - S{\X^\) = 1 - S{1 - 2x ) 



= 1 + X log2 {x) + {1 - X) log2 {1 - X) 

while for X > |, we have : 

Ci = 1 - 2 Q S{r) ^ = 1 - S{\X,\) = 1 - S{x) 

^ ^ ^ log2( 1 + ) + ^ ^ ^ log2( 1 - X ) 



2 ozv . y . 2 

We plot Ci below, using the appropriate function in their allowed ranges of x. 
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Two Pauli HSW Channel Capacity 




0.4 0.6 
Two Pauli Channel Parameter x 



Figure 22: The Holevo-Schumacher- Westmoreland classical channel capacity for the two 
pauli channel as a function of the two pauli channel parameter x. 



Note the symmetry evident in the plots. Examining our graph above for Ci, one sees 
that for < a < 5 , we have Ci(| — a) = Ci(| + 2a). This symmetry is also readily 
seen from the relations for Ci in the two allowed ranges of x (less than and greater than i). 



For X < I, setting x 



a, 



Ci («) = ! + log2 j + log2 

For X > h, setting x = I + 2a, 



^. , , 4 + 6a , / 4 + 6a\ 2 - 6a , /2-6a 
^1 («) = — ^ — log2 — 5 — + — log2 



4 + 6a 2-6a\ 4 + 6a , / 2 + 3a 
+ — - — I + — - — log2 



2 + 3a 
= 1 H 5 — log2 



6 

2 + 3a 



2 — 6a , / 1 — 3a 
log2 



1 — 3a , 
H 5 — log2 



6 

1 — 3a 



Cr(a) 
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7 Non-Unital Channels 



Non-unital channels are generically more difficult to analyze due to the fact that one or 
more of the { tfc } can be non-zero. This allows the average density matrix p = pi pi 
for an optimal signal ensemble {pi, } to move away from the Bloch sphere origin p = 

However, there still remains the symmetry present in the qubit form 



2-^ — 



V 



of the relative entropy formula, namely that = P(W||V) = f{r,q,6), where 

^ = IIWII,^ = ||V||, and is the angle between VV and V. The fact that the qubit relative 
entropy depends only on r, q, and 6 yields a symmetry which can be used to advantage in 
analyzing non-unital channels, as our last example will demonstrate. 



7.1 The Amplitude Damping Channel 

The amplitude damping channel is a non-unital channel with {tx = ty = } and 
{tz = 1 — The Xk are {A^; = Aj, = v^}, and {A^ = ^ }, where ^ is the chan- 
nel parameter, ^ G [ ; 1 ] • The amplitude damping channel is discussed in more detail in 
Appendix D. The determination of the major axis/surface reduces to an analysis in either 
the X-Y or X-Z Bloch sphere plane because of symmetries of the channel ellipsoid and the 
relative entropy formula for qubit density matrices. Since the relative entropy formula de- 
pends only on the r, q and 9 quantities which were defined above, by examining contour 
curves of relative entropy in the X-Z plane, we can create a surface of constant relative 
entropy in the three dimensional X-Y-Z Bloch sphere space by the solid of revolution tech- 
nique. That is, we shall revolve our X-Z contour curves about the axis of symmetry, here 
the Z-axis. Now the channel ellipsoid in this case is also rotationally symmetric about the 
Z-axis, because tx = ty = and = Xy. Thus optimum signal points (points on the 
channel ellipsoid surface which have maximal relative entropy distance from the average 
signal density matrix), in the X-Z plane, will become circles of optimal signals in the full 
three dimensional Bloch sphere picture after the revolution about the Z - axis is completed. 
Therefore, due to the simultaneous rotational symmetry about the Bloch sphere Z axis of 
the relative entropy formula (for qubits) and the channel ellipsoid, a full three dimensional 
analysis of the amplitude damping channel reduces to a much easier, yet equivalent, two 
dimensional analysis in the Bloch X-Z plane. 

To illustrate these ideas, we take a specific instance of the amplitude damping channel 
with ^ = 0.36. Then {t^ = ty = 0} and {t^ = 0.64}. The Afc are {A^; = Xy = 0.6}, 
and {A2 = 0.36}. In this case Ci = 0.3600 is achieved with two equiprobable signalling 

states. The optimum average density matrix has Bloch vector V 
with an asterisk in the plots below. 

In the first plot, we show the X(liorizontal)-Z(vertical) Bloch sphere plane. The outer 
bold dotted ring is the pure state boundary, with Bloch vector magnitude equal to one. 
The inner dashed circle is the channel ellipsoid. The middle solid contour is the curve of 



0.7126 



and is shown 
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constant relative entropy, equal to 0.3600, and centered at V. This relative entropy contour 
in the X-Z plane contacts the channel ellipsoid at two symmetrical points, indicated in the 
plot as O. Note that these two contact points, and the location of V, all lie on a perfectly 
horizontal line. The fact that the line is horizontal is due to the fact that the two optimum 
signalling states in the X-Z plane are symmetric about the Z axis. The point V is simply 
the two optimal output signal points average. The corresponding optimal input signals are 
shown as O's on the outer bold dotted pure state boundary semicircular curve. 




X 



Figure 23: The intersection in the Bloch sphere X-Z plane of the amplitude damping 
channel ellipsoid (the inner dashed curve) and the optimum relative entropy contour (the 
solid curve). The two optimum input signal states (on the outer bold dashed Bloch sphere 
boundary curve) and the two optimum output signal states (on the channel ellipsoid and 
the optimum relative entropy contour curve) are shown as O. 

Note that the optimum input signalling states are nonorthogonal. Furthermore, this 
analysis tells us that C2 = C3 = C4. For the amplitude damping channel, there is no 
advantage to using more than two signals in the optimum signalling ensemble. 

The following is a picture similar to those we have done for the planar channels we 
examined. We plot the magnitude of the relative entropy as one moves around the channel 
ellipsoid in the X-Z plane. The angle 6 is with respect to the Bloch sphere origin (ie : the 
X-Z plane origin). 
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Relative Entropy vs Theta : tX=0, tY=0, tZ=0.64, LambdaX=0.6, LambdaY=0.6, LambdaZ=0.36 




0.5 1 1.5 2 2.5 3 

Theta 



Figure 24: The change in T>( p \\ (p = * ) as we move p around the channel ehipsoid. Tlie 
angle theta is with respect to the Bloch sphere origin. 

Thus, the rotational symmetry about the Z-axis of the relative entropy formula, coupled 
with the same Z - axis rotational symmetry of the amplitude damping channel ellipsoid, 
yields a complete understanding of the behavior of the amplitude damping channel with 
just a simple two dimensional analysis. 

8 Summary and Conclusions 

In this paper, we have derived a formula for the relative entropy of two single qubit density 
matrices. By combining our relative entropy formula with the King-Ruskai et al. ellip- 
soid picture of qubit channels, we can use the Schumacher- Westmoreland relative entropy 
approach to classical HSW channel capacity to analyze unital and non-unital single qubit 
channels in detail. 

The following observation also emerges from the examples and analyses above. In nu- 
merical simulations by this author and others, it was noted that the a priori probabilities 

of the optimum signalling states for non-unital qubit channels were in general, approxi- 
mately, but not exactly, equal. For example, consider the case of linear channels, where the 
optimum HSW channel capacity is achieved with two signalling states. In our first linear 
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channel example, one signalling state had an a priori probability of 0.5156 and the other 
signalling state had an a priori probability of 0.4844. Similarly, in our second linear channel 
example, the respective a priori probabilities were 0.5267 and 0.4733. These asymmetries 
in the a priori probabilities arc due to the fact that D is not purely a radial function of 
distance from Voptimum- The relative entropy contours shown in Figures 2 through 11 are 
moderately, but not exactly, circular about Voptimum- This slight radial asymmetry leads 
to a priori signal probabilities that are approximately, but not exactly, equal. Thus, a 
graphical estimate of the a priori signal probabilities can be made by observing the degree 
of asymmetry of the optimum relative entropy contour about Voptimum- 

In conclusion, the analysis above yields a geometric picture which we hope will lead to 
future insights into the transmission of classical information over single qubit channels. 
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A Appendix A - The Derivation Of The Bloch Sphere Rela- 
tive Entropy Formula 

The relative entropy of two density matrices q and il) is defined to be 

V{Q\\i,) = Tr[Q{\og^{Q) - logaW)] 

Our main interest is when both q and -0 are qubit density operators. In that case, q 
and il) can be written using the Bloch sphere representation. 

^ = ^ (X + W.a) ^ = 1 (x + v.a) 

To simplify notation below, we define 

r = VvV-yV and q = Vv«V 

We shall also define cos(0) as : 

cosffc*) = 

r q 
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where r and q are as above. 

The symbol a means the vector of 2 x 2 Pauh matrices 



where 





"0 


1' 




'0 -i' 




"1 " 




1 


0. 


(^y = 


A . 




-1 



The Bloch vectors VV and V are real, three dimensional vectors which have magnitude 
equal to one when representing a pure state density matrix, and magnitude less than one 
for a mixed (non-pure) density matrix. 

The density matrices for q and -0 in terms of their Bloch vectors are : 



\ + \w3 \wi -\ iW2 



5 + 5^5 



\vi -\ iV2 



\V3 



We shall prove the following formula in two ways, an algebraic proof and a brute force 
proof. We conclude Appendix A with some alternate representations of this formula. 



V{Q\\i,) = Vi-V2 = ^log2(l - +iiog2(^^) - I log2(l 



Q I ^7— log2 



2q 



1 + 



llog,(l-r^) + ^log,(i^)-ilog,(l-,= ) 



r cos(^) , / 1 

^ log2 



1 - q 



where 9 is the angle between W and V. 

A.l Proof I : The Algebraic Proof 

V{g\\^l^) = TriQi log^ig) - log^W ) 
Recall the following Taylor series, valid for || x || < 1. 



ln( 1 + a; ) = 



- 1^ = X - 



n=l 



n 



2 X^ 

2^Y 



4 5 

X X^ 



x^ x7 
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ln( 1 - a; ) 



n=l 



X 



n 



X 



X 

T 



X 



X 

Y 



Combining these two Taylor series yields another Taylor expansion we shall be interested 



in : 



^ {ln(l + x) - HI - x)} = \ ln(|^ 



E 

n=l 



{-xr 



n 



Ex 



n=l 



n 



tZ/^ t^T^ 



A different combination of the first two Taylor series above yields yet another Taylor 
expansion we shall be interested in : 



^ { ln( 1 + X ) + ln( 1 - X ) } = ^ In 



1 - x' 



= -E 



-xY 



+ 



n=l 



n 



-E 



X 

n 



X 

T 



X 

T 



Consider log( g ) with the Bloch sphere representation for g. 



X 



X 



We obtain, using the expansion given above for log( 1 + x), 



log( g) = log 



= log 



+ log 



= log 



E 

n=l 



n 



Recall that where r • W. Thus we have for even n, 

^ VV • ^ = r", while for odd n we have (^W • =r"~^ W»a. The expression 
for log( g ) then becomes 



log( g) = log 



E 

n=l 



n 
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= log 



11 4 4 fi fi 



= log 



+ 



>V»<T 



r ^ \ H h 

3 5 7 



+ 



r 



^10 

To 



= log 



+ log 

2 r 



1 + r 
1 - r 



log 



1 - r' 



To evaluate Tr [ q log( ) ] we again use the Bloch sphere representation for q. 



We write 



rr[e log(^)] = \ tt\X • Iog(^)] + ^Tr [ (vP.a) log( ^ ) 



Using our results above, 

^ Tr [ J • log( ^ ) ] = log 

since Tr\X ] = 2 and Tr[ cxa: ] = Tr\ Uy ] = Tr[ Oz 
Similarly, 



" 1 ■ 






. 2 . 


+ \ log 


■ 1 - " 



(W.a) log(i? 



log 



1 + r 



r log 



1 + r 



where we again used the fact Tr\ I] = 2 and Tr[ ] = Tr[ ay ] = Tr[ az] = 0. 
Putting all the pieces together yields : 



Tr [ Q log( q)] = ^Tr[I • log( g)] + ^Tr 



log 



+ 2 log 



1 



+ 2 log 



(W.a) log(^) 
1 + r 



To evaluate Tr[ g log( f/' ) ], we follow a similar path and use the Bloch sphere repre- 
sentation for ip of 



= ^ (X + V.a) 
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The expression for log( ijj ) then becomes 



log( V ) = log 
Using our results above, 



" 1 ■ 




" 1 


+ q 1 












+ ^ log 


1 - q'' 


. 2 . 


2 q 


1 


- q . 







1 



Tr [ J • log( V ) ] = log 



" 1 ■ 








+ I log 


1 - q'' 


. 2 . 







log[2] + - log 



Tr 



(>V«a) log(V) 



log 



1 - q 



r cos( ^ ) log 



1 + q 
1 - q 



where we again used the fact Tr[I ] = 2 and Tr[ ax ] = Tr[ay ] = Tr[az 
We also used the fact that 



= 0. 



and therefore 



Tr 



Tr 



+ Tr 



( V X VV) 



= (V.VV) Tr[I] + (VxVv) • Tr[a] = 2V*W 
Assembling the pieces : 



Tr[ g log( V ) ] = ^ Tr[T . log( V ) ] + ^Tr 



log 



" 1 ■ 








+ ^log 


1 - q'' 


. 2 . 







(yP.a) log(V) 
1 + q 



1 - q 



Using these pieces, we obtain our final formula : 

V) = Tr[ e(log2(^) - log2(V))] 



= log2 
log2 



+ \ log2 [ 1 - 



+ 2 ^°S2 



- log2 [1 - q^ 



- cos( 6* ) log2 



1 + r 
1 — r 

1 + 9 
1 - q 
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log2 



r , 
+ 2 



1 + r 
1 - r 



log2 



cos( ) log2 



1 + g 

1 - q 



which is our desired formula. 
A - End of Proof I. 

A.2 Proof II : The Brute Force Proof 

The density matrices for q and ^ in terms of their Bloch vectors are : 



\ + \w3 \wi — iW2 
^wi +1 iW2 ^- ^Ws 



\vi -\ iV2 



The eigenvalues of these two density matrices are : 



A(2) 



2 + 2 yw2^-W3^+w^ 



- - - Vw^^^+w^^+w^ 



V - 

v(2) 



2 + -^Vv2^ + Vs^ + Vi^ 



1 1 



2 - ^VV2^ + Vs"^ + Vi"^ 



1 + r 
2 

1 - r 
2 

1 + q 
2 

1 - q 



We shall also be interested in the two eigenvectors of ijj. These are : 



|ei) = ATi 



where A^i is the normalization constant given below. 



^ + V W2^ + W3^ + Wl'^Ws + W2'^ + Ws'^ 



Wl 



Similarly, 
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1 62) = N2 



+WS wi —iws W2 



Wl ^ + W2^ 
1 



N2 



We wish to derive a formula for T>{g \\ tp) in terms of the Bloch sphere vectors W and 
V. We do this by breaking 'D{q \\ ip) up into two terms, Pi and 'D2. 

P(^IIV) = Pi - P2 
We expand Pi using our knowledge of the eigenvalues of g. 

Pi = Tr[ ^ log2(^)] 
= A« log2(A«) + log^m 



1 + r \ f 1 + r 
log2 



= - 1 + ( ^ t ) log2 (1 + r) 



1 — r\ /I — r 
log2 



1 - r 



log2(l - r) 



One notes that Pi = —S{q), where S{q) is the von Neumann entropy of the density 
matrix g. The second term, P2, is P2 = Tr[ g log2(V') ]• We evaluate P2 in the basis 
which diagonalizes ip. 



P2 = Tr[ elog2(^)] = log2(AW) Tr[^ |ei)(ei|] + log2(A(2) ) Tr [ ^ |e2) (e2| 
We use the Bloch sphere representation for g in the expression for P2. 



^ = -{I + Va) 



P2 = Tr[g log^W] = 



I log2(A^'^: 



Tr[ |ei) (ei| ] + V Tr[ Oi |ei) (ei| 



+ 
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First note that rr[ |ei) (ei| ] = Tr[ |e2) (e2 
operators. 

Next define 



Tr[ |e2) (e2| ] + ^ Wi Tr[ |e2) (e2| ] 

= 1 since the \ej) are projection 



a 



(i) 



Tr[ai |ej) (ej|] = (e^l ai \ej) 



Evaluating these six ( i = 1,2,3 and j = 1,2 ) constants yields : 



a 



(1) 



vi{q + 

V2{q + V2, 



a 



(1) 



Q 

V3 



Vi^ + y/v2^ + Vg"^ + + i)^2 _|_ ^^2 



a 



(2) _ 



.,(2) 



Oo = 



_ 



V2 



a 



(2) 



-Vi"^ + \/v2^ + Vs'^ + Vi'^V3 — V2'^ — 

+ ■Uj^ + Vi'^ - V3^ V3 



— + V'l'S^ + '^'5^ + Vi'^V3 — V2'^ — 

Putting it all together yields : 

V2 = Tr[^ log2(V)] 



g2 + 


qv3 


Vl ( q 


- vs ) 


- 


qv3 


Vl ( q 


- V3 ) 


q' - 


qv3 


V3 ( q 


- V3 ) 


q' - 


qv3 



Vl 

q 



V2 

q 



V3 



1+ E 



Wi a. 



(1) 



+ 



1 + 1: 



Wi a - 



(2) 
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1 + 2^ Wi — 



1 

2 



1 + 



log. (aJO + ^ 
log. (A^^O ^ ^ 



1+ E 



Wi 



log. (aJ,^0 



log. (a;^0 



Plugging in for the eigenvalues A^^^ and which we found above yields : 

V2 = Tr[g log2(V') ] 



1 
2 



1 + 



>V»V 



log. 



1 + q 



+ 



Q 



log. 



= 2log2(l-.^)-l + ^^log.(3^ 
Putting all the pieces together to obtain V{g\\tp), we find 



P(^IIV) = V1-V2 = ^log2(l-r2) + Jlog2 



1 + r\ 1 
1 - r j ~ 2 



log 



= o log2(l - '^^) + ^ log. 



1 + r\ 1 , /, 2 X cos(6') + ^ 



where 9 is the angle between W and V. 
A - End of Proof II. 

Ordinarily, D(p||0) / However, when r = q, we can see from the above 

formula that V{p\\(p) = V{cj)\\p). 

A few special cases of are worth examining. Consider the case when 4> = ^I. 

In this case, q = 0, and 

= \ log.(l-r^) + ^ log.(^) 



1 + r /l + r\ l + r 1-r /l-r\ l-r 

log. + + log. + = 1 - S{p) 



Thus, I)( p II ^ J ) = 1 - S{p), where <S(/9) is the von Neumann entropy of p, the first 
density matrix in the relative entropy function. 
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B Appendix B - The Derivation Of The Linear Channel 
Transcendental Equation 



In this appendix, we derive the transcendental equation for determining the optimum po- 
sition of the average density matrix for a Hnear channel. The picture of the quantities we 
shall define shortly is below. 




Figure 25: Definition of the Bloch vectors r+, g, and r_ used in the derivation below. 

We assume that in general all the { t ^ 7^ 0} . Wc also assume the linear channel is 
oriented in the z direction, so that Xx = = 0) but / 0. We define 

A = tl + tl + {t, + Kf = rl 
B = tl + tl + {U + (5\,f = q{f3f 

C = tl + tl + {U-Kf = rl 

The three quantities above refer respectively to the distance from the Bloch sphere origin 
to r+, the optimum point q we seek, and r_. We define the three Bloch vectors r+, q and 
r_ in Figure 25 above, and refer to their respective magnitudes as r^, q, and r_. Here 
(3 G [—1, 1], so that q can range along the entire line segment between and r_. 

As discussed in the Linear Channels section of this paper, the condition on q is that 
V{rj^\\q) = P(r_||(/). 

Now recall that 

^/ „ N 1 , /-, 2^ r ^ fl + r\ 1 , 2^ r cosiO) , fl + q\ 
V{r\\q) = = - log2(l - r') + 2 log2 (r^j " 2 ^""^^^^ ~ 2 Vr^j 
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where 9 is the angle between r and q. To determine 9, we use the law of cosines. If is 
the angle between sides a and b of a triangle with sides a, b and c, then we have : 

+ 62 _ ^2 

= 2^6 

Our condition I?(r+||g) = I?(r_||g) becomes : 

1 1 2n , ^ , + r+\ cos(0+) /I + 

- log(l - H) + - log ^ log ' 



2"' ^' 2 ° \ l - r+J 2 °V1 

1 1 /i 2\ , ^ fl + r-\ r- cos(6'_) , fl + q 
- log(l - ri) + - log log ' 



2 °' 2 ° VI - 2 VI - 

where we canceled the term which was identically a function of q from both sides, and 
converted all logs from base 2 to natural logs by multiplying both sides by log(2). 

Determining ^_|_ and 9-, we find : 



cos(0_ 



2qr+ 

rl + q^ - ((1 +/3)A,^2 



2qr- 

Next, recall the identity 

tanh(-i)[x] = llog(i^) 

Using this identity for arctanh, our relative entropy equality relation between the two 
endpoints of the linear channel becomes : 

llog(l -A) + VStanh<-') (v/I) - ^ + B^-gi - fi) X,f) 

= 1 ,„g(i _ C) + v/Ctanh(-) ( VC) - ^/C(C + B-((l + fflA.)-) .^^C-) 

2 ^ ' 2v B C ^ ■ 

We can cancel several terms to obtain 

^ log(l - A) + Vltanh(-i) (VI) - tanh(-i) (Vs) 

= i log(l -C) + tanh(-i) (x/c) - tenh(-i) (x/s) 
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which in turn becomes 



1 log(l - A) + ^ tanh(-i) - ~ ^J^^^^'^ tanh(-i) [Vb) 

= ^ log(l - C) + tanh(-^) (\/C7) 

Using our definitions above for A and C, we find that A - C = AX^tz- Substituting this 
into the relation immediately above yields : 

1 iog(l -A) + VA tanh(-i) (VI) - +_4/3A^) ^^^^(-i) |^^^ 

= ^log(l - C) + tanh(~i) (Vc) 
which we adjust to our final answer : 

^^-(^-+ tanh(-i) i^) 

Vb ^ ^ 

= log(l - A) - log(l - C) + 2VItanh(-i) (VI) - 2VC tanh^^^) (Vc) 

Note that B is a function of /?, so the entire functionality of P lies to the left of the 
equality sign in the expression above. All terms on the right hand side are functions of the 
{tfc} and {A^}, so the right hand side is a constant while we vary p. Since all the functions 
of P on the left hand side are smooth functions, the search for the optimum P = q, although 
transcendental, is well behaved and fairly easy. 



C Appendix C - Donald's Equality 

We prove Donald's Equality below[^]. Let pi be a set of density matrices with a priori 
probabilities Qj, so that Qj > V i and J2i = 1- Let cp be any density matrix, and 
define a = J2i Pi- Then : 

^ aiD{pi\\(t)) = D{a\\(t)) + ^ aiD{pi\\a) 

i i 

Proof : 
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Y^c^iDiPiU) = E «^ {Tr[pi\og{pi)] - Tr[pi\og{(t>)]} 

i i 

= E {Tr[pi log(ft)] } - Tr[alog(0)] 



= { Tr [ (7 log( (7 )] - Tr[a\og{a)} - Tr[a \og{(t>)] + aiTr[pi\og{pi)] 

i 

= D{a\\<\>) - rr[alog(c7)] + ^ «i ^'^[Pi log(pO ] 

i 

= D(a||0) + E {rr[pi log(pi)] - rr[pi log(a)]} 
j 

i 

A - End of Proof. 

D Appendix D - Quantum Channel Descriptions 

The Kraus quantum channel representation is given by the set of Kraus matrices A = 
{ Ai} which represent the channel dynamics via the relation : 

Sip) = E ApA] 

i 

The normalization requirement for the Kraus matrices is : 

E = I 

i 

A channel is unital if it maps the identity to the identity. This requirement becomes, 
upon setting p = I : 

E^.p4 = E^^4 = I 

i i 

Each set of Kraus operators, { A } can mapped to a set of King-Ruskai-Szarek- Werner 
ellipsoid channel parameters { tfe , }, where k = 1, 2, 3. 
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The Two Pauli Channel Kraus Representation 



■X 
A/i 



An 



A. 



l-x 



In words, the channel leaves the qubit transiting the channel alone with probability x, 
and does a Ux on the qubit with probability or does a ay on the qubit with probability 
^-5-^. The Two Pauli channel is a unital channel. The corresponding King-Ruskai-Szarek- 



Werner ellipsoid channel parameters are 
= 2x - 1. § Here x G [0,1]. 



0, and At 



A„ 



X, while 



The Depolarization Channel Kraus Representation 



A, 



X 

^/^ 



A, 



A, 



A, 



1 — X 



l-x 







l-x 



l-x 



In words, the channel leaves the qubit transiting the channel alone with probability x, 

- or does a ay on the qubit with probability 
The Depolarization channel is a unital 



and does a ax on the qubit with probability - 3 
^ ~ ^ or does a az on the qubit with probability - 



channel. The corresponding King-Ruskai-Szarek- Werner ellipsoid channel parameters are 



tx 



tz = 0, and A^. 



4x - 1 



Again x e [0,1' 



The Amplitude Damping Channel Kraus Representation 



A, 





1 



Ao 











In this scenario, the channel leaves untouched a spin down qubit. For a spin up qubit, 
with probability x it leaves the qubit alone, while with probability l-x the channels flips 
the spin from up to down. Thus, when x = 0, every qubit emerging from the channel is 
in the spin down state. The Amplitude Damping channel is not a unital channel. The 
corresponding King-Ruskai-Szarek- Werner ellipsoid channel parameters are tx = 0, ty = 
0, tz = 1 — X, Xx = \/x, Xy = ^Jx, and A^ = x. Q Again x G [0,1]. 
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E Appendix E - Numerical Analysis Of Optimal Signal En- 
sembles Using MAPLE and MATLAB 



The iterative, relative entropy based algorithm outlined above was implemented in MAPLE, 
and provided the plots and numbers cited in this paper. In addition, numerical answers 
were verified using a brute force algorithm based on MATLAB's Optimization Toolbox. The 
MATLAB optimization criterion was the channel output Holevo x quantity. Input qubit 
ensembles of two, three and four states were used. After channel evolution, the output 
ensemble Holevo x was calculated. With this function specified as to be maximized, the 
MATLAB Toolbox varied the parameters for the ensemble qubit input pure states and the 
states corresponding a priori probabilities. Pure state qubits were represented as : 

IV) = 

thereby requiring two parameters, { a , }, for each input qubit state. Thus a two state 
input qubit ensemble required an optimization over a space of dimension five, when the a 
priori probabilities are included. Three and four state ensembles required optimization over 
spaces of dimension eight and eleven respectively. 



a 
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